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Abstract — Consider many instances of an arbitrary quadripar- 
tite pure state of four quantum systems AC BR, Alice holds the 
AC part of each state, Bob holds while R represents all other 
parties correlated with ACB, Alice is required to redistribute the 
C systems to Bob while asymptotically retaining the purity of the 
global states. We prove that this is possible using Q qubits of com- 
munication and E ebits of shared entanglement between Alice 
and Bob provided that Q > \l{R] C\B) and Q + ^ > H{C\B). 
This matches the outer bound for this problem given in [1]. 
The optimal qubit rate provides the first known operational 
interpretation of quantum conditional mutual information. We 
also show how our protocol leads to a fully operational proof of 
strong subaddivity and uncover a general organizing principle, in 
analogy to thermodynamics, which underlies the optimal rates. 



I. Introduction 

THE most fundamental situation in communication theory 
consists of a two-terminal source coding problem. Here 
one user, say Alice, attempts to describe a source of informa- 
tion to another user, who we call Bob. If the information source 
is modeled by a sequence of independent and identically 
distributed (i.i.d.) random variables X, one can ask for the 
ultimate rate at which the source can be described, in units 
of bits per sample. It is required that Alice's description 
allows Bob to perfectly recreate the source sequence with 
high probability, although decreasing the error probability 
generally requires block coding on longer source sequences. 
According to Shannon's noiseless channel coding theorem [2], 
this ultimate rate is given by the Shannon entropy 

H{X) = — y^p(x) \ogp{x). 

X 

Intuitively, Shannon entropy can be understood as a measure of 
the information contained in the random variable X. Because 
Shannon entropy answers the question regarding the optimal 
rate for data compression, one says that the corresponding 
protocol for data compression provides an operational inter- 
pretation of Shannon entropy. 

Suppose now that Bob had some a priori information 
regarding the random variable X, in the form of a correlated 
random variable Y. In this case, Slepian and Wolf demon- 
strated [3] that Alice would only need to send to Bob at a rate 
given by the conditional entropy 

H{X\Y) = H{XY) - H{Y) 
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and that surprisingly, Alice would not need to know Bob's side 
information to accomplish this task. The so-called Slepian- 
Wolf protocol for data compression with side information 
provides an operational interpretation of conditional entropy. 
Intuititively, one thinks of H{X\Y) as a measure of the 
information that is to be gained by learning X for one who 
already knows Y. Note that there is no advantage if Alice 
has additional side information regarding X, and that shared 
common randomness between Alice and Bob is also of no 
help. 

In this paper, we provide a complete solution to a general 
quantum counterpart of the above scenario. We find that, in 
contrast to the classical case, additional Alice side information 
changes the problem, while quantum mechanical entanglement 
between Alice and Bob, the quantum analog of shared com- 
mon randomness, is a useful resource. Our problem is fully 
quantum in a sense introduced by Schumacher [4], where 
Alice is asked to transfer part of a pure quantum state to 
Bob, while preserving the purity of the global state. For this, 
we consider a pure state of four quantum systems |?/;)^^^^. 
Initially, the A and C systems are held by Alice, while B is in 
the possession of Bob. We refer to R as the reference system 
and suppose that it is held by a third party, say Robert, and is 
inaccessible both to Alice and Bob. We determine the cost for 
Alice and Bob to redistribute the state, so that it is Bob who 
holds C instead of Alice, thereby transferring the quantum 
information in C to Bob. More specifically, we analyze the 
corresponding asymptotic scenario, asking that many copies 
of the same state be redistributed as above, while requiring 
that the redistributed states have arbitrarily high fidelity with 
the originals in the asymptotic limit. 

To achieve this task, we allow the use of two fundamental 
quantum mechanical resources. First, Alice may send qubits 
(two-level quantum systems) to Bob over a noiseless quantum 
channel. Second, we allow Alice and Bob to use pre-existing 
entanglement, shared between themselves in the form of Bell 
states 

|$+) = i=(|00) + |ll)). 

In the sequel, we refer to such a state as an ebit (entangled 
bit). We therefore phrase the asymptotic cost of redistributing 
C to Bob in terms of the number Q of qubits sent and the 
number E of ebits consumed, per copy of the state. We allow 
the entanglement cost E to be negative, in which case the 
corresponding protocol generates entanglement rather than 
consuming it. Our main result (Theorem [T]) states that it is 
possible to redistribute the state |?/^)^^^"as above if and 
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(FQRS) [10], [11], fully quantum Reverse Shannon (FQRS) 
[10], [11], and state merging [7], [12] protocols. 

The paper is organized as follows. In the next subsection we 
fix our notational conventions. The following section gives an 
introduction to the resource calculus, and a formal statement of 
the main result. Theorem [T] which is proved in Section [lll| In 
Section |lVj we show how our results yield a fully operational 
proof of strong subaddivity which, unlike previous operational 
proofs, is logically independent even from the subadditivity of 
entropy. We conclude with a discussion in Section [V| where we 
reflect on the main result and provide a novel thermodynamic 
interpretation of the optimal rates. 



Fig. 1. The shaded region represents contains the cost pairs from Q at 
which it is possible to redistribute the C part of I^^^C"^^ from AHce to 
Bob. The figure corresponds to the case /(C; A) > I{C] B); otherwise the 
corner point, which corresponds to the optimal cost pair in j2j, would be in 
the upper-left quadrant. 



only if 



Q > \I{R- C\B), Q^E> H{C\B). 



(1) 



This region is depicted in Figure [T] The quantities in these 
bounds, conditional mutual information and conditional en- 
tropy, are defined in Section |I-A| Simultaneously minimizing 
the qubit rate Q and the total sum rate Q^E gives the optimal 
cost pair 

Q'^ = \I{R-C\B) E'^ = \I{C-A)-\I{C-B). (2) 

The optimal qubit cost gives the first known operational 
interpretation of quantum conditional mutual information. In 



Section |IV| we show that Q* cannot be negative, which leads 
to an operational proof of the celebrated strong subadditivity 
inequality [5]. This proof differs from other such operational 
proofs [6], [7] in that it follows solely from a direct coding 
theorem and not from a converse proof. In [8], where our main 
result was first announced, we showed that Q* is symmetric 
under time-reversal, where now Bob redistributes C back 
to Alice, while E* is anti- symmetric. The former gives an 
intuitive understanding to the curious identity 

I{C',R\A) = I{C',R\B), 

which holds on any quadripartite pure state. We comment 
further on this feature in Section jV] We also demonstrated 
there that the corresponding protocol is perfectly compo sable. 
This constitutes an exact solution to the quantum analog of 
result of Cover and Equitz [9] on the successive refinement of 
classical information, although the classical problem is only 
known to be exactly soluble in the presence of a Markov 
condition. 

In our protocol, the systems A and B which respectively 
remain with Alice and Bob can be regarded as quantum side 
information that is used to reduce the cost of redistribution. By 
disregarding that side information in various ways, our main 
result incorporates various quantum Shannon- theoretic proto- 
cols as special cases. As we discuss in Section fv) these include 
the Schumacher compression [4], fully quantum Slepian Wolf 



A. Notational conventions 

Throughout this paper, we assume familiarity with standard 
background material in quantum information theory; for a 
general reference, the reader is referred to [13]. We use capital 
Roman letters such as A^B^C to denote Hilbert spaces. We 
write \A\ for the dimension of A and use a superscripted label 
to associate a state to a Hilbert space, by writing or \(^)^. 
Computational basis states of A are denoted with lower case 
Roman letters as in Tensor products of Hilbert spaces 

are written AB = A ^ B. Given a pure state \(p)^^ , we 
abbreviate (p^^ = \(p){(p\'^^, while writing its partial traces 
as (p^ = Tr^ (p^^ . We write tt^ for the maximally mixed 
state on A, and given two isomorphic Hilbert spaces A and 
A', we write 

for the unique maximally entangled state associated with the 
isomorphism ^ .A quantum channel is a completely 
positive, trace-preserving linear map M^^-^ from density 
matrices on A to those on B. Given an isometry V^^^, 
we will abbreviate its adjoint action on density matrices as 
V{p) = VpV^. A partial isometry is an isometry when 
restricted to its support subspace. 

For the von Neumann entropy of a density matrix (p^ we 
write 

H{A) = -Ti^^log^ 

When the underlying state could be ambiguous we write 
H{A)^. Given a multipartite state (p^^^ , various entropic 
quantities can be defined in exact analogy to the classical case 
(see e.g. [14]). Quantum conditional entropy is defined [15] 
as 

H{A\B) = H{AB) - H{B), 

quantum mutual information [15] is 

I{A; B) = H{A) + H{B) - H{AB) 

and quantum conditional mutual information is given by 

I{A; B\C) = H{A\C) + H{B\C) - H{AB\C). 

Observe that the conditional quantities above cannot generally 
be interepreted as averages, unless the conditioning system is 
purely classical. Furthermore, notice that conditional entropy 
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can in fact be negative, as is evidenced by any pure entangled 
state on AB. On the other hand, I {A; B\C) is never negative, 
a fact which is known as strong subaddivity [5]. In Section [rv] 
we show how our main result leads to a self-contained proof 
of strong subadditivity. 

II. Resource inequalities 

It will be convenient for us to use the high-level notation of 
resource inequalities [16], [17] to express our main result, as 
well as to describe various intermediate protocols introduced 
during the proof. We take a somewhat more elementary path 
than that given in [17] which is sufficient for our purposes. 

A. Finite resource inequalities 

A single ebit shared between Alice and Bob is denoted [qq]. 
The notation [q q] represents a noiseless qubit channel from 
Alice to Bob, while a noiseless classical bit channel is written 
[c ^ c] . A finite resource inequality is an expression such as 

[q^q]>[c^ c], [q^ q]> [qq] 

meaning that the resource on the left can simulate the one 
on the right. The above two examples respectively signify 
that a qubit channel can be used to send classical bits (by 
signaling with orthogonal pure states), or otherwise can be 
used to distribute entanglement (by transmitting halves of 
locally prepared ebits). Addition of two resources may be 
regarded as having each of them available. In this way, 
for instance, the existence of the quantum teleportation and 
superdense coding protocols are proofs of the respective finite 
resource inequalities 

[qq] + 2[c -^c]>[q^ q], [qq] -^[q^q]> 2[c ^ c]. (3) 

B. Approximate resource inequalities 

Given two quantum states p and a of the same quantum 
system, we may judge their closeness using either the trace 
distance — cr||-L or the fidelity F{p^a) = \\^^/a\\^. Note 
that when one of the states is pure, F{\(p)^a) = {(p\a\(p). A 
useful characterization of fidelity - Uhlmann 's theorem - says 
that if I?/;) is a purification of p, then F(p, a) is the maximum 
of over all purifications |^) of a. Fidelity and trace 

distance related by the inequalities 

F{p,a)>l-\\p-al (4) 

\\p-CTl<2y^l-F{p,CT). (5) 

Therefore, fidelity and trace distance are equivalent distance 
measures when one is interested in arbitrarily good approxi- 
mations of states are we are here. An approximate resource 
inequality 

i 3 

is a finite resource inequality which holds with an error of e in 
the following sense. Consider acting on half of a maximally 
entangled state with each target resource hj that is a channel, 
and call the resulting global state Q.. Note that Vi should also 
contain the hj that are quantum states. Now, let Vt' be the 



simulated version of this state, obtained by using the resources 
ai . We require that Vt and Vt' are e-close in either trace distance 
or fidelity. The particular measure is not important, as we 
are ultimately concerned with asymptotics, where e can be 
arbitrarily small. 

C. Asymptotic resource inequalities 

The notion of finite resource inequalities can be generalized 
to that of an asymptotic resource inequality, which is a formal 
expression of the form 

E^in««^E^iut&i- (6) 

i 3 

(i) (i) 

Here the and bj are resources and the rates R>J and R^^^^ 
are nonnegative real numbers. We shall consider the inequality 
^ to be shorthand for the following formal statement: for 
every e > 0, every set of rates R[^-^ > R^^ , R'}^^ < R^^^^ and 
all sufficiently large n, the approximate resource inequality 

i 3 

holds. Below, we use Greek letters to denote linear combina- 
tions of finite resources which appear in asymptotic resource 
inequalities. In some asymptotic resource inequalities, we may 
only require a sublinear amount o(n) of a particular input 
resource. In such cases, we write oa -\- P >z 7 if we have 
Ra -\- P y J for every R> 0. 

It will also be convenient for us extend the definition 
of asymptotic resource inequalities to have negative rates 
on the left. Such rates are interpreted as meaning that the 
corresponding resources are generated rather than consumed. 
Formally, these resources should be negated and moved to the 
right. Let us introduce two powerful lemmas which are the 
raison d'etre for the entire formalism of asymptotic resource 
inequalities and which play important roles in our proofs. 

Lemma 1 (Composition lemma [17]): 

ay P and P h J ^ a h "J- 

Lemma 2 (Cancellation lemma [17]): Given rates which 
satisfy Ri^ > Rout > 0, 

Rind ^Ph Ronta + 7 ^ (^in - Rout)a + ^ 7- 
Otherwise, if i^out > ^in > 0, then 

Rind P h Routd + 7 ^ oa + /3 ^ (i^out - Rin)ci + 7 



D. Distributed states 

In Schumacher data compression, Alice wishes to transmit 
the C parts of many instances of the state |?/^)^^ to Bob 
while asymptotically preserving the entanglement with R. We 
introduce the following notation to describe the corresponding 
coding theorem: 

^''\^^H{C)[q^q]h^^^^''^ 
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The notation ifj^^^ indicates that AHce holds the C parts of 
many i.i.d. instances of some fixed purification of the 

density matrix ijj^ , while Bob holds nothing. On the right, 
the expression il)^^^ refers to the same purifications as on the 
left, only it is Bob who is holding the C systems. In other 
words, Alice attempts to simulate identity channels from the 
systems C in her lab to identical systems C located in Bob's 
lab. This channel is only required to work well when the input 
is equal to ?/^^. In [17], the formalism of relative resources was 
introduced for these purposes, though our alternate notation 
is sufficient for our needs. State redistribution involves a 
purification l^^^^*^^ of a tripartite density matrix ip^^^ . We 
denote the distributed states before and after the protocol as 
^^c\B r^A\CB ^^^^Q Alice begins by holding AC and 
ends by only holding A. The rates in an asymptotic resource 
inequality involving such distributed states will in general be 
entropic expressions evaluated on the implicit but arbitrary 
purification into a reference system R. Using this notation, 
we again state the main result: 
Theorem 1: 



(7) 



if and only if Q and E satisfy ([T]), i.e. are contained in the 
region depicted in Figure [T] 

The converse part of the proof of Theorem [T] i.e. that Q and E 
must satisfy ([T]), is proved in [1]. We thus focus on proving a 
coding theorem showing that ^ is satisfied whenever Q and 
E satisfy ([T]). Because [q ^ q\ > [qq\, it suffices for us to 
demonstrate ^ for the corner point {Q* ^E*) defined in 

III. Proof of Theorem[T] 

To prove Theorem [T] we will demonstrate the existence of 
the following auxilliary protocol which transfers to Bob, 
which has the desired net communication and entanglement 
cost. 

Theorem 2: 



i^^^^"" + ^I{C;RB)[q 



A\CB 



q] + ll{C;A)[qq] 



lliC;B)[qq]. 



Using a coherent version of teleportation, where Alice and 
Bob apply only local unitaries, it is known that [18] 

[qq]^2[q^qq] > 2[qq] ^ [q ^ q]. 

Repeated concatenation yields the following asymptotic re- 
source inequality [18]: 



2[q^qq] h [q ^ q] ^ [qq]. 



(8) 



In fact, the opposite direction holds as a finite resource 
inequality, but it will not be useful for us here. In this paper, 
we devote most of our efforts toward proving the following 
theorem which, when combined with ([8| and the composition 
lemma (Lemma [T]), provides a proof of Theorem |2] 
Theorem 3: 



A. Proof of Theorem |j] 

Our proof of Theorem [3] relies on the following robust one- 



shot version, whose proof we delay until Section III-B 



Theorem 4 (Robust one-shot redistribution protocol): Let 
a pure state 1^^^^*^^ and a maximally entangled state 
|<l>)^^ be given, where \A\ = \B\ divides \C\. Suppose that 

XACBR 



\ACBR 



max 



{||^ 



ACBR 



are states satsifying 

ACBRW WjACBR iACBR\ 



\v\W 



Then there exist a quantum system S with IS*! = |C|/|^| 
n encoding isometrics V^^ 

y\}SBB^CBK ^^^^^ y^\iic\i 



K, encoding isometrics y^^c^AS ^ decoding isometry 



1 

hi 



fe=i 

where rj is equal to 



K , , lACBR 



ACBRiff,\AB 



\C^)AB > 1 _ 2,y (9) 



6\/e + 4 



\S\' 



\C\ 



oU^'lL^ (10) 



Together with the cancellation lemma (Lemma [2]), Theo- 
rem |2] yields a proof of Theorem [T] However, observe that 
if I{B;C) > I{A;C), the cancellation lemma still requires 
a sublinear amount of entanglement on the left. Similarly, if 
we have I{C;RB) = I{C;R) (i.e. if strong subaddivity is 
saturated), a sublinear amount of communication will also be 
required. However, because [q ^ q] > [qq], the additional 
entanglement cost can be absorbed into the communication 
rate and is therefore only relevant if the state t/j^^^ saturates 
strong subaddivity. We discuss this point further in Section \V\ 
We prove Theorem [2] by means of another protocol which 
simulates coherent channels [18]. A coherent channel [q qq] 
is a type of quantum feedback channel which is an isometry 
from Alice to Alice and Bob: 

|0)^|0)^(0|^ + |1)^|1)^(1|^. 



Now we show how to apply Theorem |4] to pure states of 
the form (^^^^^cbr^^'^ obtain a proof of Theorem [s This 
is accomplished via the following theorem. The direct part is 
proved in [11], while the converse part follows from standard 
arguments in classical information theory (see e.g. [14]). 

Theorem 5 (Method of types): Let a tripartite state {ip)^^^ 
be given. For every e, (5 > and all sufficiently large n, 
there are projections n^"", Ilf , and such that for 

Te{A,B,C}, 

2nH{T)-n5 ^ TlUj" < 2^^^^)+^'^ 

Also, the normalized version ^(p'j^'^B'^c'^ of the subnormalized 
state 
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satisfies 



ABC\^'>^\ 



ll^^^ ^ ^ - ) 111 

and for each T G {A, B, C, AB, BC, AC} 

2nHiT)-n6 ^ y 



< € 



T"^ II ^ r.nH{T)+n5 

llo — ^ 

II 2 

T"- II 



2-nH{T)-n5 ^ ||(^^"||^ < 2-^^(^)+^^ 



2-nH{T)-n5 ^ ||(^^"|| < 2"^^(^)+''^ 

The entropies in these bounds are evaluated on |?/^)^^^. 
Additionally, the normalized version ^x^'j^'^B'^c'^ of the sub- 
normalized state 



r 



satisfies 



ABC\ 



< e. 



Finally, there is a constant c > for which we may take 
e = 2~^^^ in all of the above bounds. 

Proof of Theorem^ We apply Theorem [5] two separate times 
to the state (|?/^)^^^^)*^^, obtaining two auxilliary states 
which we will use to control the main quantities appearing in 
the error bound ([TO]) of Theorem |4] For the first, we consider 
I^^ACBi? ^ tripartite state of the systems A,C,BR. 

We thus obtain, for every (5 > and all sufficiently large 
n, a state which is e-close to in trace 

distance for e = 2~^'^^ , where all of the operator norms in the 



second term in (10) have the appropriate exponential bounds. 
With respect to the partition AR^ B^C, we similarly obtain 
another state ^(jy'^^'^c'^B'^R'^ ^^j, which the operator norms in the 
last term of ([TO]) are bounded accordingly. Alice initiates the 
protocol by Schumacher compressing the system C^. For this. 



she performs the projective measurement {11^ , 1 
on C^. According to Theorem [5] the first outcome occurs 
with probability at least 1 — e. In this case, the global state 
is replaced by the normalized version 
projected state 11^'' (|?/^)^^^^)*^"^. In case the other outcome 
occurs, Alice declares an error and the protocol is aborted. 
We condition on the first case. In what follows, we identify 
1^)^"^"^"^" with its restriction \\^)A"CsB''r'' support 
Cs of the typical projection n^"" . 

By the triangle inequality, each of ^ip'^^'^c'^^'^R'^ and 
1^)^"^"^"^" is 2e-close to |^)^"C'<5^"^" in trace distance be- 
cause all three states are e-close to (|?/^)^^^^)*^^. Therefore, 
the one- shot theorem (Theorem [4]) implies that there exist a 
quantum system S and a maximally entangled state \^)^^ 
with |A| • IS*! = \Cs\, together with encoding isometrics 



and a decoding isometry W 



SB^'B^CsB^'Kou 



satisfying ^ and (10) with e replaced by 2e. If Alice applies 
one of the isometrics Vk uniformly at random and sends S 
to Bob, after which he applies W, the system Cs will be 
transferred with high global fidelity. By measuring i^out, Bob 
can, on the average, identify Alice's encoding. Rather than 
send Bob classical information, Alice can instead simulate a 
coherent channel from a system Kin to i^ini^out by applying 



a controlled isometry 



-^A'^S 



If she tries to send half of a maximally entangled state 
|<l>)^'^-, the global pure state \Q) on A'^CsB'^R^'R' Ki^Kont 
that results from the protocol is 

It is then immediate from ([9]) that 



where 



1 

is a GHZ state. Squaring this estimate and using monotonicity 
of fidelity, we obtain 

By similar reasoning, together with ([5]), we find that 

pA-CsB-R- _ ^A-CsB-R-]^^ ^ 



Because ^^"C'^B^i?'^ e-close to 
tance, the triangle inequality implies that 

II^A^C^B^i?^ (j,ACBR\®'n 



in trace dis- 



1 <2y^ + e. 



(12) 



We may combine the estimates ( pT| ) and ([12]) using a lemma 
from [19], yielding 



> 1 



> 1 - e - 2^-12?? 



')) 



(13) 



provided rj is sufficiently small. At this point, all that remains 



is to bound the two main terms in the expression (10) for r]. 
Taking IS'I = 2^^ and k, = 2^^, the first main quantity in 
([To]) satisfies 



ic^lll^ 



B'^R'^W \\ C'^B'^R'^W'^ 

lloll^ II2 



^ 2n[H{C)+H{BR)-H{CBR)-2Q]+?>n 
^ 2n[I{C;BR)-2Q]+'dn5 



(14) 



Therefore, this term goes to zero exponentially fast provided 
that 



Q> h{C;BR)^26. 



(15) 



For the second term, 

aC-B^II iuB" 



II Oil 



II 00 2"'[^+^(^^)-^(<^)-^(^)]+3"'^ 

^ 2n[R-IiC;B)]+3n6 
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SO that if R < I{C;B) — 46, this term also goes to zero 
exponentially with n. Since each of these terms will eventually 
be less than e = 2~'^^^ , we find that 

if e is sufficiently small. Therefore, the final bound on the 
fidelity in (j3 ) can be expressed as 1 — 4e^/^ for sufficiently 
small e. 

Combining ( [l5| ) with the bound on \Cs\ = Tr 11^'' obtained 
in Theorem [5] we find that the rate at which this protocol uses 
entanglement satisfies 

Because S > can be taken arbitrarily small, it follows that 
whenever 

Q > \I{C', RB), E;^ > ^/(A; C), and R < I{B', C), 
we have, for all sufficiently large n, 

>5eV« i^^^'''' + \nR\[q^qq]. 

Since this holds for arbitrarily small e > 0, the asymptotic 
resource inequality of Theorem [3] follows. ■ 

B. Proof of Theorem [?| 

Our proof of Theorem |4] makes essential use of the fol- 
lowing robust one- shot decoupling lemma, which is proved 
in the appendix. After stating the lemma, we briefly recall 
how it is used in two previously studied special cases of our 
redistribution result, to help the reader understand the context 
into which it fits with our proof. 

Lemma 3 (Robust one-shot decoupling): Let a density ma- 
trix ip^^ be given, fix e > and let (p^^ be any state 



satisfying \m 



CE 



CE\ 



C^SB 



^ < e. Fix a unitary decomposition 



of C into subsystems and define, for each 



C^C 



Then the average state 



satisfies 



BE 



TT^ (g) V 



-SBE 



U(C) 



< 2e- 



\s? 



(16) 



The fully quantum Slepian-Wolf theorem says that, given a 
tripartite pure state l^)*^-^^, 

V-^l^ + i/(C; R)[q ^q]> + i/(C; B)[qq]. 

Together with the method of types (Theorem [5]), the above 
decoupling lemma provides an immediate proof of this re- 
source inequality. Indeed, if Alice encodes with a random 
unitary. Lemma pi ensures that a system A (identified with 



C 



B 



UW 



A 



V 



lI(C;B)' 



C B 



\ \ \ 


S B 




A 




c 



Fig. 2. Circuits for fully quantum Slepian-Wolf (left) and fully quantum 
reverse Shannon (right), related by time-reversal and swapping A ^ B. Wq 
have included the rates one gets by applying the method of types (Theorem [5j 
to the one- shot decoupHng lemma (Lemma [3j. Note that for fully quantum 
Slepian-Wolf, the random encoding determines the decoding, while for fully 
quantum reverse Shannon, the random decoding determines the encoding. 



B in the lemma), which will hold Alice's half of the gen- 
erated entanglement, is approximately maximally mixed and 
decoupled from R. This can easily be shown to imply that A 
is maximally entangled with BS (see [11], or compare with 
the proof of Theorem |4] below). Because all transformations 
are unitary and the global state is pure, this ensures that Bob 
can apply a local isometry to reconstruct C, while at the same 
time obtaining the other half of the generated entanglement. 
This scenario is illustrated on the left of Figure [2] 

By simply reversing time (and also the labels A ^ B), 
a circuit for fully quantum Slepian-Wolf provides one for 
the fully quantum reverse Shannon theorem, which given a 
tripartite state |?/^)^^^, states that 

^AC\0 ^ ^ ^] ^ ij^c; A)[qq]+ h ^-^1^. 

A circuit for this is pictured on the right of Figure |2] 

We prove Theorem |3] as follows. If Bob's side information 
is considered as part of the reference (i.e. it is disregarded 
as side information), the fully quantum reverse Shannon 
protocol can be used to transfer C from Alice to Bob, at least 
making use of Alice's side information. By a modification of 
that protocol provided below, Bob's side information can be 
utilized to simulate the required coherent channels [q qq] 
as follows. Rather than choosing a single random unitary 
for the decoding, we choose exponentially many (roughly 
2n/(C;B) ^i^Q.^ apply the method of types to the one- 
shot result). We further guarantee that if Alice chooses one 
of the corresponding encodings uniformly at random. Bob 
can, on average, correctly distinguish that encoding in order 
to apply the correct decoding. Thus, it is possible for Alice to 
"piggyback" classical information on the transmitted qubits, 
which Bob can access by means of his side information 
(cf. [20], [21]). We further ensure that this can all be done 
coherently, where Alice instead applies a superposition of 
encodings by using a controlled isometry which is controlled 
by an arbitrary quantum state. The circuit we construct for 
performing this task noncoherently is illustrated in Figure |3] 
Our proof of Theorem |4] relies on two other lemmas. First, 
we require the operator inequality [22]: 
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Fig. 3. Circuit for using Bob's side information to piggyback extra classical 
(or coherent) information through the fully quantum reverse Shannon circuit 
on the right of Figure [2| 



Lemma 4: If < 11 < 1 and 11 < A, then 

1 - A-^/^nA-^/^ < 2(1 - n) + 4(A - n). 



We also will use the following coherification lemma, which 
allows us to convert protocols which transmit classical infor- 
mation to ones which simulate coherent channels. We give a 
short proof in the appendix. 

Lemma 5: Given a pure state and n unitaries 

IV^/c)^^ = Ukl'ip)^^. Given any other set of pure 
states IV^^)^^ and a POVM {Af } on D, there are complex 
phases ak such that the isometry 



-D^DK 



satisfies 



^f](A;K^|£|^;)>l-2(P+v^r^). 



where 



Proof of Theorem |4| As in the statement of the theorem, we 
fix nearby states \Lp) and \(\)) and let W^^^^ be any unitary 
decomposition of C into subsystems. Independently choose 
K unitaries {/7i, . . . , t/,^} according to the Haar measure on 
U{C). For each /c, define the states 



ASBBR 



W\^Pk) 



ACBR 



We define the decoupling fidelity for ip^^^ as 



Since |<|))^^|^)^<^^^ is a purification of tt^ (g) ip^^, 
Uhlmann's theorem implies that there is an isometry 
Yj^s^AAC ^nder which 

To send the message k, Alice will apply the isometry Vk = V^. 
We now define 

which is the state that is created after ^lice performs Vk and 
gives S to Bob, who then applies W^^^^^ . We may therefore 
equivalently write 

The average decoupling fidelity is a random variable 

1 

-^ave ~ ~ / , Fk 
k=l 

which depends on the random choice of unitaries. We lower 
bound its expectation as follows. Define the average states 
with respect to Haar measure dU as 



ACBR 



-ASBBR 



U{C') 

ACBR 



We now use the robust one-shot decoupling lemma 
(Lemma |3]) to bound the expectation of Fave over the random 
choice of unitaries: 



1 



1 




BBR 



A related estimate to be used later is 



1/4 



(17) 



which follows by concavity and the inequality + y < 
-Jx + y^, valid for y > 0. 

Next, we consider Bob's ability to distinguish the states 
"^'k^ . For this, we design a measurement which distinguishes 
the nearby states = Ukcj^^^Ul. Let 11 be the projection 
onto the support of (j)^^ and define 

Uk = UkUUl 

while defining the "pretty good measurement" 

k=l 
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The probability that this measurement fails to identify the state 



^/'f ^ is 



Observe that 



k 111 



< 2^/l-Fk + e 
Because x ^ ^Jx is concave, we have 



1 " 



Therefore, the average of the P/c can be bounded using 
Lemma |4] obtaining a random variable satisfying 

p = - V p^ 

ave — / , K, 
k=l 

1 

<l^ave + -^Tr(]l-Afc)(/)^^ 

/c=l 

1 

< Dave + - V (2(1 - IVEfc*/.^^) + 4 V Ttllk'^k'') 



k=l 



k'^k 



/c=l k'^k 



The last line holds because for each k.IVk projects onto the 
support of (j)^^ . By taking the expectation over the random 
choice of unitaries, this yields 



CB 



E Pave < E P>ave + 4a>: E Tr Hi^^ 

= EPave+4A^Tr [Eni(7r^ ) 



The second inequality is by (18), while the third is due to 
^Vl\ and holds for sufficiently small e, i.e. sufficiently large 
n. We may then conclude that for a particular value of the 
randomness, the same bound holds without the expectations. 
Finally, we define Bob's decoding isometry to be W = CW\ 
completing the proof. ■ 

IV. An operational proof of strong subadditivity 

Let l^^^c*^^ be an arbitrary pure state. In this section, we 
show how our results lead to an operational proof of strong 
subaddivity, i.e. that I{C;R\B) > 0. By discarding some 
resources on the right in Theorem [2j we obtain: 

^AC\B ^ ii(^C;RB)[q ^q]+ '^I{C;A)[qq] 

h '^nC;B)[q^q]. 
Intuitively, it makes sense that we should have 

/(C; RB) - I{C] B) = /(C; P|P) > 

since otherwise, a noiseless qubit channel could be used to 
faithfully transmit more than one qubit in the presence of 
entanglement between the sender and receiver. Of course this 
inequality is guaranteed by strong subadditivity. However, our 
aim is to provide an alternative proof of this fundamental 
inequality. The above asymptotic resource inequality implies 
that for every e, (5 > and all sufficiently large n, we have 

^L\L' ^ ^V^IiC,RB)+n5\ ^ ^] >^ 2L?^(^'^)J [q ^ q]. (19) 

^jL\L represents the prior entanglement between Alice and 
Bob, although its precise form is irrelevant for our argument. 
Now consider the following lemma, whose proof we delay 
until the end of this section. 

Lemma 6: Let K and Q be quantum systems and let 
|^^L|L arbitrary. Consider any potential simulation 



K^K ( K 



KL^P , 



< ED. 



of the identity quantum channel id 



K^K 



by the possibly 



< 2Ev/l--Fave + e- 



\C\ 



(18) 



We now apply Lemma [5] identifying AR E and CB ^ D. 
We can thus conclude that there is an isometry £CB^cbk 
under which 

- V {kMCm > 1 - 2(Pave + Vl-^ave). 

Taking expectations, we find that 

i_Eif](A;K^|/:|^;:) 



smaller one id*^^*^, assisted by the bipartite state j^*)^'^ . 
If |$fe)^ ^ is maximally entangled, then the entanglement 
fidelity [23] satisfies 

IQI 

(20) 



K 



< 2EVl-J^ave + 2EPave 

/ 4K||(/)^^|L||<i)-^|| 



< 6Ve + 4 



\C\ 



Plugging in to the right side of (20), we find that the 
entanglement fidelity is upper bounded by 2Lt^(^5^l^)+"^^J . 
Suppose now that strong subaddivity was not satisfied. Then, 
for some sufficiently small 5 > the entanglement fidelity 
would tend to zero exponentially fast. However, ([19]) implies 
that the entanglement fidelity can be made arbitrarily close to 
1. Therefore, we must have the inequality I{C; R\B) > 0. ■ 

Proof of Lemma^ Let {Ei} and {Dj} be Kraus matrices 
for the encoding g^^^Q and decoding X>^^'^^ . Fixing 
orthonormal bases of L and L' which Schmidt-decompose the 
assistance state as 



\s\' 



\c\ 



I*) 



L\L' 



L' 
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the above Kraus matrices can be written in block form 



En ' ' ' Ei\L\ 



Because these maps are trace-preserving, we have 

i 3 

which in turn impHes that 

i 3 

The overall map M^^^ has Kraus matrices {Nij} given by 
Nij = ^ \^XiDjiEii. 



The entanglement fidelity ( [20| can be written in the following 
form [23]: 



On the other hand, 

|2 



^|Tr7V,^-| =Y,M^^Dj^Eu\ 

ij iij 

<Y,MQ\i:^ElD],DjtEi, 



(22) 



= |Q|^A,Tr 
= IQI^A.TrU 

= \Q\-m. 



(23) 



Above, ( [22| ) is by the Cauchy- Schwartz inquality because 
rank(l)j^£^^^) < \Q\, while (23) follows from the identities 
( [2T] ). The last line holds because the squares of the Schmidt 
coefficients sum to unity. Combining this estimate with the 
previous expression for the entanglement fidelity proves the 
lemma. ■ 

V. Discussion 

We have identified the cost, in terms of entanglement and 
transmitted qubits, of moving a subsystem of a multipartite 
quantum state between two spatially separated parties when 
the sender and receiver each have quantum side information. 
Our strategy was to prove a new resource inequality which, 
when combined with other known results, implies the exis- 
tence of the optimal protocol. The minimal rate at which qubits 
must be sent provides the first known operational interpretation 
of quantum conditional mutual information. While operational 
interpretations of quantum mutual information are known [6], 
[24], these do not simply lead to one for the conditional quan- 
tity by naively subtracting mutual informations. An operational 
interpretation for a quantity is a proof that it is an optimal rate 
for performing some information processing task. Such a proof 



must consist of a protocol which achieves that rate, together 
with a converse demonstrating optimality, as was given in [1] 
for our situation. 

Our interpretation provides an explaination of the quadri- 
partite pure state identity ^I{C;R\A) = ^I{C; R\B) because 
the inherent reversibility of our protocol implies that the 
communication cost is the same in both directions. Indeed, 
with the exception of the Schumacher compression step, which 
is essentially reversible because it succeeds with high proba- 
bility, the protocol constructed to prove Theorem |3] consists 
entirely of isometrics. Moreover, the additional steps required 
to prove Theorem [T] do not introduce further nonunitarity. 
Throughout this paper, we have adhered to the convention of 
always conditioning on Bob's side information, although this 
was an arbitrary notational choice. We thus interpret quantum 
conditional mutual information as a measure of the quantum 
correlations between C and R, from the perspective of either 
party. Technically, our result provides an interpretation for one 
half of the conditional mutual information; nonetheless, we 
observed in [8] that by teleportation, we obtain a bona fide 
interpretation of conditional mutual information (i.e. without 
the 1/2) as the optimal classical communication rate for state 
merging [7]. 

Our protocol includes various others as special cases. When 
neither Alice nor Bob has any side information, or if they sim- 
ply disregard it so that is considered part of the reference, our 
protocol reduces to Schumacher compression - the technical 
components for this are contained in Theorem |5] When it is 
only Alice who has side information, our protocol reduces to 
the fully quantum reverse Shannon protocol, which itself can 
be proved by trivially combining Theorem [5] and the robust 
decoupling lemma (Lemma [3]). The same theorems can be 
shown to imply the fully quantum Slepian-Wolf theorem, in 
which it is only Bob who uses side information. As pointed 
out in [8], the formal time-reversal duality between fully 
quantum Slepian-Wolf and fully quantum reverse Shannon 
observed in [10] is embodied in a more natural way by our 
new protocol, which is in fact self-dual with respect to time 
reversal. In [8], we also observed the intuitively satisfying - 
but nonetheless surprising - fact that successive redistribution 
can be performed optimally using the optimal redistribution 
protocol. 

State redistribution solves the most general two-terminal 
source coding problem that can be considered for the class 
of "fully quantum" coding problems for i.i.d. quantum states. 
Because the main technical part of our proof is proved in 
a one-shot fashion, it is likely that it could be applied to 
more general quantum sources that do not satisfy the i.i.d. 
property but which do have some internal regularity structure; 
for instance, to ground states of many-body Hamiltonians in 
statistical physics. However, it would perhaps be most useful 
for such applications to have a more direct proof of Theorem [T] 
which does not use coherent channels or the cancellation 
lemma. While it would be most desirable to have a one-shot 
version of Theorem [T] it would perhaps be most natural to 
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Fig. 4. Potential one-shot redistribution circuit making time-reversal sym- 
metry apparent. 

find a one-shot version of the related resource inequality 

^^\BC ^i_I^C;B)[qq]. 

The corresponding circuit for this case makes the time-reversal 
symmetry most apparent, as illustrated in Figure [4] 

We expect state redistribution to be a useful primitive for 
studying more complicated state transfer problems. Most gen- 
erally, one can imagine n spatially separated parties all holding 
various parts of a global multipartite state, wishing to shuffle 
their subsystems around in some arbitrary but predetermined 
way. There is a multitude of ways in which redistribution could 
be applied to give achievable rate regions for such problems, 
where each round of communcation would fit our general 
setting, although they would most likely be suboptimal in 
general. A simple example along these lines, for which the 
optimal solution is not yet known, was considered in [25], 
where Alice and Bob wish to swap two systems. We expect 
that judicious use of state redistribution should lead to new 
achievable rates for this problem, by optimizing over ways of 
splitting the systems to be swapped into subsystems. 

During the proof of our main result, we found that in some 
cases where strong subaddivity is satisfied /(C; R\B) = 0, our 
protocol nonetheless requires a nonzero amount of quantum 
communication. One cannot hope for much better, as the 
following example illustrates. Suppose that Alice wishes to 
redistribute to Bob the C parts of many copies of an arbitrary 
product state |?/^)^^|?/^)^^. Alice can simply throw away 
the states {ip)^^ and then dilute [26] her entanglement with 
Bob to create those states nonlocally. Despite the fact that 
that strong subaddivity is saturated on the global state, it is 
nonetheless known [27], [28] that in general, entanglement 
dilution requires a sublinear amount of communication. 

It has been apparent that one half of the mutual information 
plays a central role in characterizing the optimal rates in this 
paper. In the following, somewhat mysterious fashion, this 
quantity can be considered as a "measure" of the correlations 
between two subsystems. By analogy with thermodynamics. 



it is possible to identify an underlying heuristic organizing 
principle governing our optimal rates which perhaps could 
lend itself to further generalizations of redistribution. The 
main task of state redistribution is to transform between two 
configurations of the subsystems as follows: 

AC\B\R^ A\CB\R. 

Let Anitiai/finai (resp. B) dcnotc the systems Alice (resp. 
Bob) holds at the beginning/end of the protocol. Consider 
the following "dynamic potentials" relative to Alice^Bob 
communication: 

A'^itilf = |^(^; initial) = ll{R;AC) 

D^-^^ = ^liR'^AMal) = ^I{R',A). 

We interpret these as indicating the correlations between 
Alice's systems and the reference, both before and after 
redistribution. The optimal qubit rate for redistribution is easily 
shown to equal the difference between the dynamic potentials 

D^n^" - A^maf = '2nC;R\A) = \I{C;R\B). 

We are therefore operationally justified in interpreting this 
difference as measuring the correlations with the reference 
that Alice must transfer to Bob to redistribute the state. 
Analogously, we may also define "static potentials" 

^Mtial^ = ^-^( ^initial 5 ^initial) = ^^(^^5^) 
^tn^^ = ^^(-4finai; Sfinal) = \l{A]CB) 

which indicate the correlations between Alice's and Bob's 
systems at each state of redistribution. Similarly, the optimal 
ebit rate can be shown to equal the difference of the static 
potentials 

~ ^Mtilf = ^^(^5 C) — \I{B] C). 

It is operationally justifiable to consider this difference as the 
amount of excess correlation between Alice and Bob that is 
involved in going between the two configurations. 

Relative to the Bob^Alice direction, the dynamic potentials 
are subtracted from a constant 

^MtiaMnal — ^{^) ~ ^initiaVfinal 

while the static potentials obey 

qA^B ^ qB^A 

^ initial/ final final/ initial • 

Subtracting these potentials as above, we find that 

nB^A _ nB^A _ rtA^B _ T^A^B 
^final ^initial — ^final ^initial 5 

while 

nA^B _ nA^B _ _( qB^A _ qB^A\ 
^final ^initial — Wfinal ^initial )^ 

providing another explaination of the symmetry properties of 
the optimal rates. One could easily imagine generalizations of 
the above in which more complicated potentials are defined 
for redistribution problems involving many more parties. How- 
ever, we expect it would quite challenging to find operational 
justifications for such a theory. 
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Appendix 

Here we collect the proofs of some auxilliary results used 
in the proof of Theorem |4] Our proof of the robust decoupling 
lemma (Lemma [3]) relies on the following non-robust version. 

Lemma 7 (One-shot decoupling): Let a density matrix 
(^^^ be given and fix a unitary decomposition W^^^^ of 
C into subsystems. For each unitary U^^^, define 



Then 



/ 

Ju 



U{C) 



Wu 



(24) 



Proof of Lemma |j| By convexity of the trace norm 



■ TT^ (g) 



< 



U{C) 



where dU is Haar measure on U{C). We use the triangle 
inequality to bound the integrand: 



(25) 
(26) 



+ ||7r'^(8)V''^-7r^(8)y)-^||j. (27) 



'^U 111 



The second term is bounded using monotonicity, unitary 
invariance of the trace norm, and the assumed e-closeness of 

ip^^ and (f^^: 



BE 
U 



111 < 



,^SBE, 
lCE 



< e. 

Similarly, the last term satisfies 



(28) 



< e. 



1 

CE\\ 



Because x 
satisfies 



is convex, the integral of the first term 



JuiC) 



< 



I 

JU{C) 



The theorem follows by applying Lemma |7] to this integral. 



Proof of Lemma |5[ To begin, note that we may choose the 
complex phases so that {k\{ilj\C\il)'^) = |(/c|(?/;|>C|V^^)|. Now 

>\{k\{mi'k)\'-\mk)-jci^'k)i- 

Because < A/^ < 1, we have 

Furthermore, unitary invariance of the trace norm and Q 
imply that 



Therefore, 

{kmcm > (Tr^fcAfe)' - 2y'l-|(V'fe|V^)|2. 

and X ^ —\fx are 

l^{fcl(^l£l^^)>(i-p)2-2^/r^ 



Finally, because the functions x 
convex, we find that 



/e=l 



as required. 
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